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Estimates of cosmological parameters using galaxy clusters have the scatter in the observable at a given mass as a fundamental 
parameter. This work computes the amplitude of the scatter for a newly introduced mass proxy, the product of the cluster total 
luminosity times the mass-to-light ratio, usually referred as stellar mass. The analysis of 12 galaxy clusters with excellent total masses 
shows a tight correlation between the stellar mass, or stellar fraction, and total mass within r sm with negligible intrinsic scatter: the 
90% upper limit is 0.06 dex, the posterior mean is 0.027 dex. This scatter is similar to the one of best-determined mass proxies, such as 
Y x , i.e. the product of X-ray temperature and gas mass. The size of the cluster sample used to determine the intrinsic scatter is small, 
as in previous works proposing low-scatter proxies because very accurate masses are needed to infer very small values of intrinsic 
scatter. Three-quarters of the studied clusters have IgM Sj 14 M B , which is advantageous from a cosmological perspective because 
these clusters are far more abundant than more massive clusters. At the difference of other mass proxies such as Y x , stellar mass can 
be determined with survey data up to at least z = 0.9 using upcoming optical near-infrared surveys, such as DES and Euclid, or even 
with currently available surveys, covering however smaller solid angles. On the other end, the uncertainty about the predicted mass of 
a single cluster is large, 0.21 to 0.32 dex, depending on cluster richness. This is largely because the proxy itself has as 0.10 dex errors 
for clusters of IgM S 14 M Q mass. 

Key words. Galaxies: clusters: general — Galaxies: stellar content — Galaxies: luminosity function, mass function — Cosmology: 
cosmological parameters — Cosmology: observations methods: statistical 



1. Introduction 

Galaxy clusters are the most massive bound structures. These 
structures, as well galaxy groups, in this work are called halos, 
a term that does not require an arbitrary boundary between the 
two types of objects. Halos are both cosmological probes and 
giant physics laboratories. Halos are multicomponent systems 
formed mostly by dark matter and by baryons in the form of gas 
and stars. The proportion of gas (intra-cluster medium) and star 
components is 10:1 at IgM ~ 14.5 M Q , but 1:1 at IgM -13.5 
M (e.g. Andreon 2010 and reference therein) and 1:2 at IgM ~ 
12.5 - 13 M (Dai et al. 2010; Humphrey et al. 2011, 2012). 
The cluster mass function is steep (e.g. Press & Schecther 1974; 
Jenkins et al. 2001), and therefore most halos have low masses 
and thus comparable amounts of stars and gas. Therefore, in all 
but the shallowest (in mass) surveys, galaxies and gas contain 
similar amounts of baryons. 

The efficient use of clusters as cosmological probes requires 
a low-scatter mass proxy because cosmological constraints be- 
come looser and looser with increasing scatter between mass 
and mass proxy (e.g. Lima & Hu 2005). Quantities derived from 
survey data have generally large scatter: richness and not-core- 
excised X-ray luminosities have a 0.19 dex intrinsic scatter (rich- 
ness: Andreon & Hurn 2010, X-ray luminosity: Vikhlinin et al. 
2009a, Mantz et al. 2010a, see Andreon & Hurn 2010 for a com- 
parison of the predicted mass uncertainties of these two prox- 
ies). Instead, the intrinsic scatter of the SZ mass proxy has yet 
to be robustly measured from the data (Allen, Evrard, & Mantz 
201 1). Proxies based on higher quality data perform better: core- 
excised X-ray luminosities have a small, and for this reason 
poorly determined, intrinsic scatter of 0.015-0.025 dex (Mantz 
et al. 2010a) above IgM ~ 14.5 M Q and an unknown scatter be- 



low; Yx shows a similarly small and poorly determined intrinsic 
scatter of 0.052 dex above IgM ~ 14.5 M (Mantz et al. 2010a) 
or 0.07 above IgM ~ 14 M (Arnaud et al. 2007). 

More complex proxies, which combine information from, 
say, X-ray and SZ data, should perform even better than those 
using only one of them. In practice, one should have high-quality 
data for both proxies, not just for one of them. This usually 
strongly reduces the usable sample and makes it harder to de- 
rive the selection function needed to determine of cosmological 
constraints. 

The scatter in observable at a given mass, and not the other 
way around, is a parameter appearing explicitly in the estimates 
of cosmological parameters based on galaxy clusters counts and 
is thus a fundamental parameter. In fact, a- proxy \M has been used 
in past cosmological estimates using cluster counts (Vikhlinin et 
al. 2009, Mantz et al. 2010b, Rozo et al. 2010). To understand 
why this scatter, and not the other way around, appears in cos- 
mological estimates one must recall that cosmological parame- 
ters are constrained by changing them until the predicted abun- 
dance, as a function of the observable, matches the observed dis- 
tribution. Thus, one needs to go from the mass function, p(M), 
to the predicted distribution in the observable, p(proxy). Since 
p(proxy) oc p(proxy\M) * p(M) (Bayes' theorem, see also Lima 
& Hu 2005 and Rozo et al. 2010 for its cosmological applica- 
tion), p(proxy\M) (whose minimal parametrisation is given by 
the scatter at a given mass, 0~ prO xy\M) is needed. One of the aims 
of this work is to compute this quantity and compare it to the 
scatter of other mass proxies, completing the similar exercise 
done in the review paper of Allen et al. (2011) for other mass 
proxies (their Sect. 3.3.4) 
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It is worth emphasizing that the scatter computed the other 
way around, in mass at a given observable, is also an interesting 
quantity because it is an approximation of how well one may es- 
timate the mass of a single cluster. To be precise, o~M\ P roxy alone 
is not enough. The whole probability distribution p(M\proxy) is 
needecQ because one should also at least account for the proxy 
uncertainty and for the uncertainties in the mean relation be- 
tween mass and proxy. The width of the posterior predictive 
distribution encapsulates all these uncertainties (including less 
obvious ones, such as errors coming from extrapolations) and 
has been used in Andreon & Hurn (2010) to compare the per- 
formances of Lx and «200 as estimators of the mass of individual 
clusters. 

We emphasise that, once the intrinsic scatter is smaller than 
the mass error, its value is poorly determined (for all mass prox- 
ies) and should be read as an upper limit because of the uncer- 
tainty of the mass error and of the likelihood function (often 
assumed to be Gaussian). Small changes of them imply large 
changes in the derived intrinsic scatter when the latter is compa- 
rable to mass errors. So, although in principle an intrinsic scatter 
much smaller than the mass error can be determined from data 
of arbitrary precision, in practice values smaller than mass errors 
should read as upper limits. 

In this paper, we introduce a new mass proxy, the stellar 
mass, which shows a negligible small intrinsic scatter with hy- 
drostatic mass. Stellar mass is derived as in previous works 
(e.g. Gonzalez, Zaritsky, & Zabludoff 2007; Andreon 2010): it 
is given by the stellar M/L ratio times the total cluster luminos- 
ity. Three-quarters of the studied clusters have IgM S= 14 M Q , 
which is advantageous from a cosmological perspective because 
these clusters are far more abundant than more massive clusters. 
Furthermore, we show that stellar masses can be derived from 
imaging data, making it possible to determine the mass proxy 
up to at least z = 0.9 with both current and upcoming imaging 
surveys. 

An optical mass proxy with small scatter is of paramount 
importance, given the current large investment in optical and 
near-infrared surveys, such as the Dark Energy Survey (DES0) 
(Abbott et al. 2005), Euclid3 (Laureijs et al. 2011), and the 
planned Large Sinoptic Survey Telescope (Ivezic et al. 2008). 
All these surveys cover thousands of square degrees in several 
bands with a depth appropriate to derive the proxy value to z = 1 
and beyond (see Sect. 3.3 for details). 

Throughout this paper, we assume Q.m = 0.3, Qa = 0.7, 
and //o = 70 km s _1 Mpc~'. Magnitudes are quoted in their 
native system (quasi-AB for Sloan Digital Sky Survey (SDSS) 
magnitudes). All logarithms in this work are on base ten. 

2. Data and sample 

We started from clusters in Vikhlinin et al. (2006) and Sun et 
al. (2009) with: a) hydrostatic mass errors lower than 25% to 
retain a sample of clusters with accurately measured masses. In 
practice, because of the other constraints, the typical mass error 
of halos retained in the sample is 0.05 dex; b) inside the SDSS 
8''' data release (Aihara et al. 2011); c) with redshift between 
0.02 < z < 0.14, and d) mass larger than 10 13 2 M Q to keep 



1 An appreciation of the difference between p(x\y) and p(y\x) may, 
following D'Agostini (2012), be obtained by replacing x with woman 
and y with senator, obtaining x 10 - 60% for the former (in most coun- 
tries) and <K 0.1% for the latter. 

2 http://www.darkenergysurvey.org/ 

3 http://www.euclid-ec.org 



clusters for which the stellar mass can be accurately measured. 
In practice, the redshift range of the retained sample, listed in 
Table 1, is narrower than imposed (0.02 < z < 0.078) be- 
cause of the other constraints. The retained sample consists of 
12 unique clusters plus two repeats, i.e. objects listed in both 
catalogs (MKW4 and Abell 1991) and analysed twice to check 
analysis-dependent systematics. 

The studied sample is based on clusters with available 
masses, and we have not filtered it by using any optical clus- 
ter property, such as the presence of a prominent red sequence 
or a centrally located brightest cluster galaxy. Therefore, while 
we are sure that we have not introduced a selection bias, both 
the starting and the retained samples have an unknown repre- 
sentativity. Nevertheless, the starting sample should not be com- 
pletely unrepresentative, because it has been successfully used 
to calibrate the mass-Tx scaling relation for cosmological esti- 
mates (e.g. Vikhlinin et al. 2009). 

Measurements are performed within the r^oo radiufl listed 
in Vikhlinin et al. (2006) and Sun et al. (2009), and within the 
virial radius, r2oo, derived from rsoo and C500 (also listed in these 
papers). For three clusters (3C442, A2092, and NGC6269), the 
r2oo computation assumes a concentration index of four, lacking 
C500 for these two clusters in these papers. 

The analysis of the optical data strictly follows Andreon 
(2010, A10 hereafter). The basic data used in our analysis are 
g and r photometry from SDSS, down to r = 19 mag. The latter 
is the value where the star/galaxy separation becomes uncertain 
(e.g. Lupton et al. 2002) and is much brighter than the SDSS 
completeness limit (e.g. Ivezic et al. 2002). Specifically, we use 
"cmodel" magnitudes for "total" magnitudes and "model" mag- 
nitudes for colours. We computed the mass in stars from the clus- 
ter total luminosity, the latter computed as the integral of L times 
the luminosity function of red galaxies. We adopted a Bayesian 
approach, as done for other clusters (e.g. Andreon 2006, 2010, 
Andreon et al. 2006, 2008b, Meyers et al. 2012), and we account 
for the background (galaxies in the cluster line-of-sight), which 
is estimated outside the cluster turnaround radius. We adopted 
a Schechter (1976) luminosity function and the likelihood ex- 
pression given in Andreon, Punzi & Grado (2005), which is 
an extension of the Sandage, Tammann & Yahil (1979) likeli- 
hood expression for the case where a background is present. The 
Bayesian approach allows us to easily propagate uncertainties 
and their covariance into the integral of the luminosity function 
and into the stellar mass. For example, in contrast to all other 
authors (except Meyers et al. 2012), we do not assume to com- 
plete knowledge of the faint end slope of luminosity function and 
instead marginalise over it, allowing us to propagate this uncer- 
tainty in derived quantities. As a visual check, we also computed 
the luminosity function by binning galaxies in magnitude bins 
(e.g. Zwicky 1957, Oemler 1974, and many papers since then). 

In the luminosity function computation, attention should be 
paid to saturated stars misclassified as galaxies (we inspected 
the SDSS images and removed them by hand) and to the bright- 
est cluster galaxy (BCG), which might not be drawn from the 
Schechter (1976) function (e.g. Tremaine & Richstone 1977). 
To deal the latest issue, we fit all galaxies, excluding the BCG, 
and added its luminosity contribution separately. Finally, we 
removed by hand very bright (much brighter and larger than 
the BCG) unrelated galaxies that were spectroscopically in the 
fore/background of the cluster. 



4 r A is the radius within which the enclosed average mass density is 
A times the critical density. 
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In this paper, we define as red galaxies those within 0. 1 red- 
ward and 0.2 blueward in g - r of the colour-magnitude relation, 
as in Andreon & Hurn (2010) and A 10, and in agreement with 
works mentioned there. 

Because of the shallow nature of the SDSS, it misses the 
flux coming from the galaxy outer regions (e.g. Blanton et al. 
2001, Andreon 2002). For this reason we corrected the measured 
flux by 15% (Blanton et al. 2001). To compute the luminosity 
in a sphere from the measured values (derived in a cylinder), 
we assumed a Navarro, Frank & White (1997) profile. Finally, 
luminosities are converted in stellar mass by adopting the M/L 
value derived by Cappellari et al. (2006). 

To check whether our stellar mass are over/underestimated, 
we performed an approximate calculation using simulations, that 
account only for the effect of the finite sample size of cluster 
galaxies, and of the background subtraction because these two 
terms were found to be the main sources of error. First, we sim- 
ulated 1000 clusters, all composed of 73 galaxies whose lumi- 
nosities are drawn from a Schecther (1976) function with slope 
a = -1. These are best-fit values of NGC6269. As for real 
NGC6269 data, luminosities are drawn down to M* + 5. To ob- 
tain the total luminosity of each simulated cluster, we summed 
luminosities of the simulated galaxies (Schechter's draws). Now, 
we needed to simulate the uncertainty associated to the back- 
ground subtraction. We took a power-law to describe the back- 
ground counts with best-fit parameters measured for NGC6269 
background area (slope 0.4 and 27 background galaxies per unit 
cluster solid angle). Next, we added the luminosity of a realiza- 
tion of the background and removed the luminosity of another 
realization to get the total net cluster luminosity. Since the num- 
ber of background galaxies is subject to Poisson fluctuations, 
the number of draws for the two background realizations is al- 
lowed to fluctuate Poissonianly. Finally, since the background 
for NGC6269 is estimated on a solid angle 12 times larger than 
the cluster area, our simulation also modelled this. The total net 
luminosity showed a scatter, from simulation to simulation, of 
0.11 dex. For NGC6269 we derived 0.12 dex from real data 
(Table 2), the latter also accounting for other (minor) sources 
of error not modelled in this simulation. The two estimates are 
close enough not to warrant a more complex simulation and 
show that our stellar mass errors are accurate. 

Several of our steps are only useful for putting the observ- 
able, the stellar mass, in standard units and can be safely re- 
moved without affecting the quality (intrinsic scatter) of the pro- 
posed mass proxy. We can skip the conversion from luminosity 
to mass because this is a single number and therefore has no 
effect on the scatter around the mean relation (it only changes 
the intercept, i.e. the unit of the quantity being measured). In the 
same vein, we may also remove the correction for the missing 
flux, which again only affects the intercept because it is a sin- 
gle number. Similarly, the conversion from flux in a cylinder to 
flux into a sphere is a multiplicative term with a negligible de- 
pendency on concentration (it then affects only the intercept). 
Because of the depth in the cluster rest-frame of the photometry 
used, the light emitted by r > 19 mag galaxies is also negligible 
and could be ignored. In short, stellar mass observationally mea- 
sures the amount of light emitted from red detected galaxies, and 
several of our operations are only useful to obtain standard units 
for this quantity. 

As mentioned, these "corrections" affect the absolute value 
of the intercept of the mass-proxy relation. In other observa- 
tional conditions (e.g. for shallow observations of distant clus- 
ters), these corrections may take different values and differ 
from cluster to cluster. Therefore, we prefer to characterise 



Table 1. Stellar masses. 



ID 


logio M 


*' r 500 


logio M 






[M ] 




[M e ] 






Abell 160 


12.33 ± 


0.12 


12.48 


+ 


0.10 


Abell 1177 


12.05 ± 


0.15 


12.10 


± 


0.16 


Abell 1795 


12.66 ± 


0.05 


12.80 


+ 


0.05 


Abell 1991 


12.46 ± 


0.12 


12.64 


+ 


0.09 


Abell 2029 


12.59 ± 


0.08 


12.70 


+ 


0.08 


Abell 2092 


12.34 + 


0.10 


12.44 


+ 


0.08 


MKW4 


12.25 ± 


0.14 


12.40 


± 


0.13 


NGC 4104 


12.08 ± 


0.20 


12.12 


+ 


0.25 


NGC 5098 


12.11 ± 


0.15 


12.29 


+ 


0.12 


NGC 6269 


12.39 + 


0.12 


12.58 


± 


0.11 


RX J1022+3830 


12.43 + 


0.16 


12.51 


+ 


0.11 


3C442A 


11.97 ± 


0.14 


12.00 


+ 


0.14 


Abell 1991 


12.48 ± 


0.11 


12.66 


+ 


0.09 


MKW4 


12.19 ± 


0.15 


12.27 


+ 


0.17 



The last two lines indicate independently derived estimates based on 
partially independent data. 

the proxy "stellar mass" rather than the observation-dependent 
raw cluster luminosity, which likely has a larger scatter with 
mass (for widely varying observational data) and an observation- 
dependent intercept. This choice also simplifies the comparison 
with recent works. 

3. Results 

Table 1 gives the stellar masses found within 7-500 and r2oo and 
their errors. Typically, stellar masses of clusters in our sample 
have 0. 1 1 dex errors. Two clusters appeared twice in our sample 
and were independently analysed based on partially independent 
data (e.g. different background regions, partially different clus- 
ter regions because of the numerically different 7-500 values, in- 
dependent fits, different csoo). We found nearly identical stellar 
masses showing the negligible effect of non-modelled sources 
of error. Three clusters (Abell 160, MKW 4, and RXJ1022) are 
in common with A10 and have nearly identical stellar masses 
within r2oo- A10 used a former release of the SDSS, which im- 
plements different types of magnitudes, different r^x) values (de- 
rived using a caustic analysis), and different background regions. 

In order to fit the trend between stellar and total mass, we 
use a standard Bayesian fitting model. Essentially, the model as- 
sumes that the true stellar mass and true halo mass are linearly 
related with some intrinsic scatter. However, rather than having 
these true values, we have noisy measurements of both stellar 
mass and halo mass, with noise amplitude different from point 
to point. The code to perform the computation is given in the 
Appendix A of A10 (ses also Andreon 2006, 2008; Andreon et 
al. 2006, 2008a, 2008b; Andreon & Hurn 2010; Kelly 2007). 

Using the (fitting) model above, we found for our sample of 
12 clusters: 

lgM^ rm = (0.38 + 0.07) (logM 5 oo - 14.5) + 12.53 ± 0.04 . 

Unless otherwise stated, results of the statistical computations 
are quoted in the form x + y, where x is the posterior mean and y 
is the posterior standard deviation. 

Figure 1 shows the relation between stellar mass and halo 
mass, observed data, the mean scaling and its 68% uncertainty 
and the mean intrinsic scatter around the mean relation. The in- 
tercept and slope marginals are well approximated by Gaussians. 
The (posterior) probability distribution of the intrinsic scatter 
cr sca , is an exponentially decreasing function (Fig. 2), i.e. the in- 
trinsic scatter is too small to be reliably measured with the data 
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13.5 
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12.5 
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1 1.5 

13.5 14 14.5 15 
log M500 [M s ] 

Fig. 1. Stellar vs total mass. The mean relation between stellar 
mass and halo mass is marked with a solid line, its 68% un- 
certainty is shaded (in yellow). The dashed lines show the mean 
relation plus or minus the (posterior mean) intrinsic scatter a scm ■ 
Error bars on the data points represent observed errors for both 
variables. 




0.1 0.2 0.3 
intr. scatter o\ 



scatt 



Fig. 2. Posterior probability distribution of the intrinsic scatter 

O~lgM+\M 500 - 



on hand. The same conclusion may also qualitatively inferred by 
noting that the mean model crosses the error bars of almost all 
points. Data are adequately described by the model: we gener- 
ated 5000 fake data sets from the model, computed their x 2 , and 
found that 71% of them show a x 1 larger than the (true) data 
on hand (an extreme p- value, say 55 1% or ^ 99% would call 
for a model revision). In this p-value computation, we accounted 
for the uncertainty of the regression parameters (intercept, slope, 
and intrinsic scatter) by adopting Bayesian p-values (Gelman et 
al. 2004, Andreon 2011a). 

The 90% upper limit to the intrinsic stellar mass scatter at a 
given halo mass, cr sca , = o"^ My | M500 , is 0.06 dex. The mean and 
median values are 0.027 and 0.017 dex. 

As mentioned in Sect. 2, we also measured stellar masses 
within r2oo- Using the (fitting) model above, we found for our 
sample of 12 clusters: 

lgM^ rim = (0.38 + 0.07) (logM 50 o - 14.5) + 12.65 + 0.05 

Figure 3 shows the relation between stellar mass and halo 
mass, observed data, the mean scaling and its 68% uncertainty 
and the mean intrinsic scatter around the mean relation for these 
stellar masses. 



O 



13.5 



13 



12.5 



12 




11.5 

13.5 14 14.5 
log M500 [M a ] 

Fig. 3. Stellar vs total mass. As Fig. 1, but for stellar masses 
measured within r2oo- 



As for rsoo, the intercept and slope marginals are well ap- 
proximated by Gaussians, and the (posterior) probability distri- 
bution of the intrinsic scatter cr scat is an exponentially decreasing 
function with posterior mean 0.05 + 0.05, median 0.03, and 90% 
upper limit of 0. 12 dex. Also in this case, the data are adequately 
described by the model (Bayesian p-value: 30%). 

Compared to the relation found using stellar masses mea- 
sured within the smaller radius rsoo, we found an identical slope, 
a larger normalization (because stellar masses are integrated in 
larger apertures), and a somewhat looser constraint on the in- 
trinsic scatter (expected because we are now computing stellar 
masses in extrapolated r2oo values). Therefore, the tight scaling 
does not seem to be a unique feature of the rsoo radius. 

The slope, derived here from hydrostatic masses within rsoo, 
is in agreement with what was found in A10 for a much larger 
sample of 52 clusters using dynamical masses within r2oo- The 
latter sample is not restricted to clusters appearing to be re- 
laxed in X-ray (unlike most samples dealing with Yx) and con- 
sists of a random sampling of an X-ray flux limited sample 
(as detailed in A10 and also shown by a numerical simula- 
tion in Andreon & Berge 2012). A10 uses caustics and veloc- 
ity dispersion-based masses within r2oo and found a slope of 
0.45+0.08 and 0.53 ±0.08 vs 0.38+0.07 found in this work. The 
agreement of the derived slope between hydrostatic-, caustics-, 
and velocity dispersion-based masses indicates the absence of 
a gross tilt between the three mass scales. It also indicates that 
the current slope is not driven by clusters at the extreme of the 
mass range. The A10 large sample also offers the opportunity of 
checking if the found small cr scan is an unwanted consequence 
of the small sample size. To this aim, we recomputed the poste- 
rior distribution of <T SC att by forcing the slope to be 0.45 + 0.08 
(i.e. adopting the latter as prior, in place of the originally adopted 
Student-f prior) and found an indistinguishable posterior proba- 
bility distribution. This indicates that the small scatter is not due 
to overfitting. 

The scatter we found in the present work is lower than, al- 
though consistent with at 95%, the scatter found in A10 using 
noisier masses (0.14 vs 0.05 dex errors for masses within r2oo)- 
As mentioned, intrinsic scatter values smaller than the mass er- 
rors (like those in A 10) should be read as upper limits, hence 
increasing the agreement. Indeed, A10 assumed Gaussian errors 
(likelihood function) for mass, whereas the later work by Serra 
et al. (2011) found errors to be asymmetric (i.e. the likelihood 
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is not Gaussian). If the uncertainty in the likelihood shape and 
in the error noisiness were accounted for, a larger uncertainty on 
the intrinsic scatter would have been found by A 10, improving 
the current (95%) agreement. 

More in general, imprecise measurements of mass (i.e. with 
» 0.05 dex errors) are the probable reason why the tight stellar 
mass vs halo mass has not been noted before: imprecise errors 
not accounted for in the statistical analysis tend to leave a resid- 
ual scatter of the order of a fraction of the mass error, which the 
fitting algorithm attributes to the intrinsic scatter term. 

3.1. The usefulness of stellar masses for cluster samples 

How does the scatter in stellar mass at a given halo mass com- 
pare to the scatter of other mass proxies? In this comparison, we 
strictly follow Sect. 3.3.4 of the Allen et al. (201 1) review paper. 

- the 90% upper limit of the scatter of Yx at a given mass is 
0.06 dex (Mantz et al. 2010a) above IgM ~ 14.5 M Q (vs 0.06 dex 
for stellar mass within rsoo). Other works (Arnaud et al. 2007, 
Kravtsov et al. 2006) report compatible point estimates, but with 
unspecified uncertainties. 

- the upper limit of Mantz et al. (2010a) to the scatter of the 
core-excised X-ray luminosity at a given mass is $ 0.04 dex (at 
an unspecified level) vs 0.06 dex for stellar mass (90% upper 
limit). 

To summarise, the stellar mass is a mass proxy with a scat- 
ter similar to best-mass proxies. Furthermore, the range in mass 
in which the relation holds extends to lower masses than other 
proxies, which is advantageous from a cosmological perspec- 
tive because clusters of lower mass are far more abundant than 
more massive clusters. On the other hand, one should not jump 
to conclusions: the relative merit of different mass proxies can- 
not be estimated by any single number (e.g. intrinsic scatter). 
One should also consider the slope of the mass-proxy relation 
(stellar mass has a shallower slope than core-excised X-ray lu- 
minosity and Yx) and the size of the sample for which the mass 
proxy can be measured (stellar mass has probably the largest 
sample size, being measurable from survey data, see Sect. 3.3), 
etc. Nevertheless, one of the parameters in cosmological esti- 
mates using galaxy cluster counts and using stellar mass as mass 
proxy is now known and found to be tiny at most. 

Our sample size, 12 clusters, is small. However, Vikhlinin et 
al. (2009b) measured the scatter with only four additional clus- 
ters, the widely cited small Yx scatter of Vikhlinin et al. (2006) 
and Arnaud et al. (2007) is derived analysing smaller samples, 
and Kravtsov et al. (2006) proposed the Yx mass proxy with only 
four more (simulated) clusters. We have already commented that 
the scatter remains small even forcing the slope to be the one 
derived from a much larger cluster sample (with noisier mass 
errors). Mantz et al. (2010a,b) have a much larger sample, but 
their M500 are obtained via a complex and indirect path passing 
through f gas (see their Sect. 2.2.3). 

Similarly, concerns that the scatter is found small because 
of overestimated halo mass errors should be allayed by the fact 
that: a) there is some overlap among cluster samples; four of our 
clusters are in common with Vikhlinin et al. (2006), three with 
Vikhlinin et al. (2009b, they removed MKW4) and that all halo 
mass errors, for common and uncommon clusters, have been de- 
rived by a single team in the same way; and that b) even assum- 
ing that mass errors are zero (an unbelievably small value) in 
order to maximally boost the intrinsic scatter, its posterior is al- 
most unchanged, i.e. stellar mass compares as favorably as the 
best-mass proxies even in this implausible case. Finally, our nu- 
merical simulation in Sect. 2 shows that our stellar mass errors 



are accurate, and thus we exclude that the small cr scatt are due to 
inaccurate (overestimated) errors on stellar mass. 

Two comments about cluster physics are in order. First, a 
small scatter between stellar mass in red galaxies and halo mass 
is expected if during cluster build-up at most a small fraction 
of galaxies is added to the existing red population, i.e. if the 
fraction of galaxies that turns from blue to red is small. This is 
likely the case because the fraction (in number) of blue galaxies 
is small in clusters at all redshift (Raichoor & Andreon 2012; 
Andreon et al. 2006). Moreover, blue galaxies have, on average, 
lower masses than red ones, and thus their contribution to the 
total stellar mass is negligible (e.g. Fukugita et al. 1998; Girardi 
et al. 2000). The found small scatter is instead at variance with 
numerical simulations (e.g. Young et al. 2011). However, these 
fail to reproduce, by a large factor, both the stellar fraction and 
the amount of intracluster light, indicating that probably sub-grid 
physics is not yet accurately known/implemented. 

Second, since the intrinsic scatter of the stellar mass frac- 
tion is negligible (all clusters have the same stellar mass at a 
given cluster mass), the known observed intrinsic scatter in the 
gas fraction (Sun et al. 2009, A 10) cannot be compensated by 
a correlated scatter in stellar mass fraction to keep constant the 
total baryon fraction. 

3.2. The usefulness of stellar masses for individual clusters 

In the previous section, we computed the scatter in observable at 
a given mass, o~ proxy \M- As mentioned in the introduction, cosmo- 
logical estimates need p{proxy\M), whose minimal parametrisa- 
tion is given by a Gaussian with <x = <T proxy \M (see, e.g. Allen 
et al. 2011 or Weinberg et al. 2012 reviews). In fact, Vikhlinin 
et al. (2009), Mantz et al. (2010b), and Rozo et al. (2010) used 
o~pmxy\M in their cosmological estimates using cluster counts. For 
the stellar mass, we found in the previous section a 90% upper 
limit of 0.06 dex on cr proxy \ M . 

The inverse scatter, cr M \ pro „, is also interesting. As men- 
tioned in the introduction, it is the minimal error of the estimated 
mass of a given cluster. As detailed in Andreon & Hurn (2010), 
the error in the proxy itself, and of course, its availability, limit 
the usefulness of a given proxy. We computed the inverse scatter 
as in previous section, fitting the same data set, but using stellar 
mass as the predictor variable. We found 

log M 500 = (2.02 ± 0.35) (lgM* >rm - 12.5) + 14.39 + 0.09 . 

The posterior probability distribution of the intrinsic scatter 
cigM 5aa \M„ is an exponentially decreasing function, indicating that 
the scatter is too small to be determined with the data on hand, 
with mean 0.08 dex, median 0.04 dex, and 90% upper limit 0.2 
dex. These numbers quantify the uncertainty of the mass of a sin- 
gle cluster predicted from its lgM+j 5m , i.e. its optical luminosity, 
if all other sources of errors (the above proxy-mass calibration 
and the proxy uncertainty) are negligible. However, this is un- 
likely to be the case of stellar masses for clusters with IgM 5= 14 
M Q , which are also the majority in our sample. This is because 
they have a typical 0.1 dex error in stellar mass (Table 1), which 
propagates into a 0.2 dex error in the predicted mass. The poste- 
rior predictive mass uncertainty, which encapsulates both proxy 
errors and errors coming from the proxy-mass calibration, turns 
out to be, on average, 0.32 dex for these not massive systems. 
For our two most massive systems, the predictive mass uncer- 
tainty is 0.22 and 0.25 dex, whereas for the very rich MS 1054 
cluster discussed in next section it is 0.21 dex. 

These uncertainties are larger than those one may obtain 
from high-quality X-ray-based mass estimates, such as Yx or 
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Fig. 4. Depth and wavelength coverage of DES and Eclid sur- 
veys. Upper panel: g,r,i, and z (from left to right) filters are in- 
dicated with thin dashed (blue, green, red and black) lines, riz, y, 
J, and H (from left to right) Eclid filters are indicated with thick 
solid (blue, green, red, and black) lines. The horizontal tick in- 
dicates the ~ lOcr depth. Bottom panel: Wavelength coverage of 
the filters for redshift bins, where galaxies are brighter than the 
~ 10cr depth. The shaded (yellow) region marks the wavelength 
sampling of g - r at z < 0.08. 



core-excised X-ray luminosities. We emphasise, however, that 
the value of a mass proxy also depends on its availability and 
that obtaining precise estimates of Yx or core-excised X-ray lu- 
minosities requires expensive, even unfesable, follow-up obser- 
vations from space. The value of a mass proxy also depends on 
how well the mass-proxy relation is calibrated. The Yx and core- 
excised X-ray luminosity vs mass relations are un-calibrated be- 
low IgM — 14 - 14.5 M G , the precise value depends on which 
mass proxy is considered. This is a mass quite large mass value 
in the cluster mass function and even more so at intermediate and 
high redshift. Therefore, stellar masses are a useful mass proxy 
for clusters of not large mass or when one may not afford the lux- 
ury of having the needed high-quality follow-up observations to 
derive more precise predicted masses. 

Strategies to reduce the size of stellar mass errors and thus to 
increase its quality as mass estimator for individual clusters, are 
being investigated. 

3.3. On which part of the Universe is stellar mass 
measurable with current data? 

The analysis of previous sections uses g — r colour and lumi- 
nosities of galaxies brighter than r < 19 in the nearby Universe 
(z < 0.08). Figure 4 illustrates how these constraints change with 



redshifts. The top panel illustrates the apparent luminosity of a 
red z = 0.08 r — 19 galaxy, modelled as a zj = 5 single stel- 
lar population using the 2007 version of the Bruzual & Chariot 
(2003) synthesis population model, for different filters: g, r, i, 
and z for the Dark Energy Survey (Abbott et al. 2005) and riz, Y, 
J, and H for Euclid (Laureijs et al. 2011) with the correspond- 
ing ~ 10<x depth (horizontal ticks). The bottom panel illustrates 
the A range sampled by these filters. Only redshift bins where 
galaxies are brighter than the 10 cr depth are plotted. The shaded 
yellow is the A range sampled by g - r at z < 0.08. As the figure 
shows, we always have two filters in the shaded region, i.e. up to 
z = 0.9, these data have the depth and wavelength coverage ap- 
propriate to measure stellar masses as we did at z < 0.08. Indeed, 
an even larger redshift range is accessible for the massive clus- 
ters, because the depth requirement can be safely relaxed, as it is 
not necessary to accurately measure stellar masses for these ob- 
jects. We further emphasise that these depths, which cover sev- 
eral thousands of deg 2 , are quite comparable to the depths of 
surveys available right now covering a ten of deg 2 , such as the 
deep fields of the CFHTLS (Cuillandre & Berlin 2006), VISTA- 
VIDEC0 (Jarvis et al. 2012), or WIRDS (Bielby et al. 2011). 
Repeating this exercise with the planned Large Sinoptic Survey 
Telescope 10 yr integration (Ivezic et al. 2008), we found that 
Z = 1.2 may be easily reached. 

In order to use stellar masses for large cluster samples, the in- 
tegral of the luminosity function computation should not include 
any manual step or complementary (e.g. spectroscopic) data. In 
our analysis of nearby clusters, we removed by hand saturated 
stars misclassified as galaxies and used spectroscopy to iden- 
tify foreground galaxies brighter and larger than the BCG. The 
first step can be automatically implemented by using the central 
object intensity (or any flag based on it). The second step may 
use galaxy sizes and photometric redshifts in place of spectro- 
scopic redshifts that are perhaps not available. Therefore, auto- 
matic computation is possible for large cluster samples. 

In order to test whether the increased galaxy background at 
high redshift may be detrimental for an accurate measurement 
of the stellar mass, we computed the stellar mass of two high 
redshift clusters using real data. 

We first consider MS 1054 (Gioia et al. 1990) at z = 0.83, 
a massive cluster of galaxies (IgMyx, ta 14.8 M , Jee et al. 
2005) for which we consider only V < 24.7 mag (to match 
depths) and V — I colour, coming from FIRES (Labbe et al. 
2003; Forster Schreiber et al. 2006), which was already used 
in Andreon (2006) to measure the luminosity function of its red 
galaxies (see his Figure 4). Figure 5 shows the V-I vs / diagram 
of galaxies in the cluster line-of-sight (top panel) and in a refer- 
ence line-of-sight (bottom panel). As is fairly obvious, the back- 
ground contribution, depicted in the bottom panel, at the colour 
of red galaxies (inside the dashed lines), is negligible. Our lumi- 
nosity function determination of red galaxies of several clusters 
(e.g. of the 28 clusters in Andreon 2008, up to z = 1.3) observa- 
tionally confirms that a low background can be easily achieved 
by adopting a colour index bracketing the 4000 A break and 
that the background is not a large penalty for red galaxies up 
to the highest redshifts. Furthermore, the background contami- 
nation can be further reduced using photometric redshift. This 
situation extends up to the largest redshifts: Andreon (2011b) 
found that there are on average 3.6 back/foreground red galaxies 
in the z ~ 2.2 JKCS041 cluster (Andreon et al. 2009, Andreon 
& Huertas-Company 201 1) line-of-sight from measurements all 
around the cluster vs 23 galaxies on the red sequence. We com- 



http://star-www.herts.ac.uk/%7Emjarvis/video/index.html 
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Fig. 5. Colour-magnitude relation in the cluster (top panel) and 
control field (bottom panel) line-of-sight directions. The solid 
line marks the colour-magnitude relation, whereas the dotted 
lines delimit the portion of plane that qualifies galaxies to be 
called red. The vertical line marks the considered limiting depth. 
In this figure, we use magnitudes in the Vega system, for com- 
parison with the similar figures in Andreon (2006). 



puted the MS 1054 stellar mass as we did for lower redshift clus- 
ters and found an error of 0.04 dex. The error is small because of 
the large richness of MS 1054 and the low background contami- 
nation at the red sequence colour. 

As a worse case scenario, we consider RzCS052 at z = 1 .016 
(Andreon et al. 2008a,b). The cluster is still a massive one 
(lgM 50 Q a 14.4 M Q , Andreon et al. 2008a), barely detected in 
~ 10 ks XMM observations^. The cluster is at a redshift larger 
than the maximal one that we conservatively said possible for 
stellar mass computation, and even more so with the available 
data (from Andreon et al. 2008b), which are shallower (by a 
few tenths of mag) than the DES+Euclid surveys, and made 
the derivation of stellar mass challenging. The colour-magnitude 
plot and luminosity function of the red galaxies of RzCS052 are 
depicted in Fig 2 and 4 of Andreon et al. (2008b). We computed 
the stellar mass as we did for lower redshift clusters and found 
an error of 0.1 dex, in spite of having taken a cluster outside 
the redshift range claimed to be accessible, having used data of 
lower depth than upcoming surveys, and having discarded the 
multicolour data available for this cluster (and for the consid- 
ered surveys). 

As for other mass proxies (e.g. Yx and Lx), the quoted er- 
ror assumes to perfectly know the evolution of the intercept of 



There is a typo into the exponent of the cluster flux reported in 
Andreon et al. 2008a: the correct flux is 1.2 10~ 14 erg -1 s -1 cnT 2 , as 
one may discern from the quoted Lx- 



the relation between mass and mass proxy. In the case of stellar 
masses, the evolution is modulated by the M/L ratio of red galax- 
ies, which is robustly known (e.g. Treu et al. 2005, Holden et al. 
2010) up to at least z ~ 1. Instead, the evolution of other mass 
proxies is poorly known (e.g. the evolution of Yx or Lx with z, 
Andreon, Trinchieri & Pizzolato 2011). 



4. Summary 

The analysis of a sample of clusters of the nearby Universe 
(z < 0.08), which is of low cardinality because of the need of 
high-quality mass estimates (0.05 dex mass errors), shows that 
stellar mass, derived measuring the luminosity emitted by red 
galaxies only, has a tiny scatter with halo mass, similar to best- 
mass proxies. We show that stellar masses can be measured with 
survey-quality data (optical and near-infrared imaging) typical 
of current (on some square degrees) and upcoming (on several 
thousand of square degrees) surveys up to z = 0.9 - 1.2. 

Three-quarters of the studied clusters have IgM % 14 M Q , 
which is advantageous from a cosmological perspective because 
these clusters are far more abundant than more massive clus- 
ters. In constrast to other mass proxies, we have robust knowl- 
edge about the evolution of the stellar mass proxy at a given 
halo mass. This is because the mass evolution is modulated by 
the evolution of the M/L of red galaxies, which is observation- 
ally well constrained up to at least z ~ 1 . We emphasise, how- 
ever, that a small scatter is an essential property for a good mass 
proxy, but this number alone cannot completely characterise a 
mass proxy: availability of the mass proxy and its error also are 
two essential properties. We also emphasise that if the purpose 
of a mass proxy is to derive the mass of an individual cluster 
(as opposed to a sample), survey-based stellar masses should be 
viewed as a complementary mass proxy to be used when one 
may not afford the luxury of having non-survey high-quality 
masses requested for better mass proxies, such as Yx or core- 
excised luminosities, or when these measurements are infeasible 
or uncalibrated. 

Clusters in our sample display an intrinsic scatter in gas frac- 
tion (Sun et al. 2009, Andreon 2010) and a tiny scatter in stellar 
fraction. Therefore, the latter cannot compensate for the scatter 
of the former to keep the total baryon fraction constant. 
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